Characterization of MMX-enhanced DSP and Multimedia Applications on a General Purpose Processor
نویسندگان
چکیده
Proper use of native signal processing (NSP) instruction set enhancements can result in speedup for targeted applications. In this paper, we study the behavior of the X86 architecture's Multimedia Extension (MMX) instruction set on signal processing and multimedia algorithms and applications. In addition to quantifying speedup, we make comparisons based on detailed dynamic instruction pro ling. We compare a suite of digital signal processing (DSP) and multimedia programs implemented in C code and the same programs implemented with calls to an MMX assembly library that performs ltering, vector arithmetic, and other relevant kernels. As expected, our analysis shows decreased execution time for most, but not all, of our MMX programs compared to their unmodi ed equivalents. The observed speedup for the programs using MMX ranges from 1.2 to 7.5. For each set of programs, we perform a detailed instruction level analysis using VTune. This allows us to isolate the speci c reasons for speedup or lack thereof. This analysis allows one to understand which aspects of native signal processing are most useful and how it can be utilized most e ciently. L. John is supported in part by grants from the National Science Foundation and the Texas Advanced Technology Program. B.L. Evans is supported on a US National Science Foundation CAREER Award under Grant MIP-9702707.
منابع مشابه
Characterization of MMX-enhanced DSP Applications on a General Purpose Processor
We intend to investigate the behavior of MMX optimized applications on an X86 general purpose processor. The applications will be written as C programs. The speci c applications are yet to be determined, but will be chosen carefully to represent the behavior of DSP applications as a whole. We will analyze the instruction mix and memory access behavior of both applications with MMX instrucions a...
متن کاملProgrammable VLIW and SIMD Architectures for DSP and Multimedia Applications
Digital Signal Processing (DSP) and multimedia workloads are expected to be the dominant workloads on future computer systems. This is true in both low cost embedded applications that use specialized microprocessors like DSPs and in the generalpurpose processor market. Very Long Instruction Word (VLIW) architectures have multiple functional units to take advantage of vastly available Instructio...
متن کاملUltra-Low-Energy DSP Processor Design for Many-Core Parallel Applications
Background and Objectives: Digital signal processors are widely used in energy constrained applications in which battery lifetime is a critical concern. Accordingly, designing ultra-low-energy processors is a major concern. In this work and in the first step, we propose a sub-threshold DSP processor. Methods: As our baseline architecture, we use a modified version of an existing ultra-low-power...
متن کاملEvaluating VLIW and SIMD Architectures for DSP and Multimedia Applications
Digital signal processing (DSP) and multimedia applications are expected to be the dominant workloads on future computer systems. In this paper, we evaluate the performance of a very long instruction word (VLIW) processor using Texas Instruments Inc.’s TMS320C6x and a single-instruction multiple-data (SIMD) processor using Intel’s Pentium II processor (with MMX) on a set of benchmarks. Our benc...
متن کاملPerformance Evaluation and Benchmarking of Native Signal Processing
DSP processor growth is phenomenal and continues to grow rapidly, but general-purpose microprocessors have entered the multimedia and signal processing oriented stream by adding DSP functionality to the instruction set and also providing optimized assembly libraries. In this paper, we compare the performance of a general-purpose processor (Pentium II with MMX) versus a DSP processor (TI’s C62xx...
متن کامل